A Discriminative Alignment Model for Abbreviation Recognition

نویسندگان

  • Naoaki Okazaki
  • Sophia Ananiadou
  • Jun'ichi Tsujii
چکیده

This paper presents a discriminative alignment model for extracting abbreviations and their full forms appearing in actual text. The task of abbreviation recognition is formalized as a sequential alignment problem, which finds the optimal alignment (origins of abbreviation letters) between two strings (abbreviation and full form). We design a large amount of finegrained features that directly express the events where letters produce or do not produce abbreviations. We obtain the optimal combination of features on an aligned abbreviation corpus by using the maximum entropy framework. The experimental results show the usefulness of the alignment model and corpus for improving abbreviation recognition.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Discriminative Approach to Japanese Abbreviation Extraction

This paper addresses the difficulties in recognizing Japanese abbreviations through the use of previous approaches, examining actual usages of parenthetical expressions in newspaper articles. In order to bridge the gap between Japanese abbreviations and their full forms, we present a discriminative approach to abbreviation recognition. More specifically, we formalize the abbreviation recognitio...

متن کامل

Robust Approach to Abbreviating Terms: A Discriminative Latent Variable Model with Global Information

The present paper describes a robust approach for abbreviating terms. First, in order to incorporate non-local information into abbreviation generation tasks, we present both implicit and explicit solutions: the latent variable model, or alternatively, the label encoding approach with global information. Although the two approaches compete with one another, we demonstrate that these approaches ...

متن کامل

A Latent Discriminative Model for Compositional Entailment Relation Recognition using Natural Logic

Recognizing semantic relations between sentences, such as entailment and contradiction, is a challenging task that requires detailed analysis of the interaction between diverse linguistic phenomena. In this paper, we propose a latent discriminative model that unifies a statistical framework and a theory of Natural Logic to capture complex interactions between linguistic phenomena. The proposed ...

متن کامل

Markovian Mixture Face Recognition with Discriminative Face Alignment

A typical automatic face recognition system is composed of three parts: face detection, face alignment and face recognition. Conventionally, these three parts are processed in a bottom-up manner: face detection is performed first, then the results are passed to face alignment, and finally to face recognition. The bottom-up approach is one extreme of vision approaches. The other extreme approach...

متن کامل

FACE ALIGNMENT USING BOOSTED APPEARANCE MODEL (Discriminative Appearance Model)

This thesis explores method of face alignment using Boosted Appearance Model (BAM). Like Active Appearance Model (AAM) we call our method as Boosted Appearance Model (BAM) since our appearnce model is trained by boosting. In this method, face alignment is done by maximizing the score of a trained two-classifer which is able to distinguish correct alignment and incorrect alignment, so that the c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008